Localizing Text and Symbols in Images from Biomedical Journal Articles
نویسنده
چکیده
Automatic localization and recognition of text and symbols in images found in biomedical journal articles could significantly improve indexing and retrieval of biomedical literature, thus contributing to clinical decision support. Main difficulties in automatic localization of text and symbols in medical images are in the irregularity of their occurrence and in the variety of font features. The difficulties are compounded by image quality, image background interference, arbitrary location, and variability in the text block size. We present results of automatic localization and annotation of text and symbols in medical images. Our methods take advantage of gross image features and automatically identified image modality (classification of images into 4 broad types: color, illustration, radiographic and other.) 2D adaptive noise removal Wiener filtering is used as preprocessing step to reduce the image noise. Automatic histogram thresholding, morphological method, Quadtree technique, DCT, and connected component analysis are selectively used on different image types for extracting text and symbol locations. Text area merging and region growth techniques are used as post-processing methods to improve the precision of the bounding box locations. Initial experiments on 100 images achieve precision and recall of 78.42% and 89.38%, respectively, with an average accuracy of 72.02%.
منابع مشابه
Natural scene text localization using edge color signature
Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...
متن کاملAutomatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval
Biomedical images are often referenced for clinical decision support (CDS), educational purposes, and research. They appear in specialized databases or in biomedical publications and are not meaningfully retrievable using primarily textbased retrieval systems. The task of automatically finding the images in an article that are most useful for the purpose of determining relevance to a clinical s...
متن کاملBiomedical article retrieval using multimodal features and image annotations in region-based CBIR
Biomedical images are invaluable in establishing diagnosis, acquiring technical skills, and implementing best practices in many areas of medicine. At present, images needed for instructional purposes or in support of clinical decisions appear in specialized databases and in biomedical articles, and are often not easily accessible to retrieval tools. Our goal is to automatically annotate images ...
متن کاملExploring use of images in clinical articles for decision support in evidence-based medicine
Essential information is often conveyed pictorially (images, illustrations, graphs, charts, etc.) in biomedical publications. A clinician’s decision to access the full text when searching for evidence in support of clinical decision is frequently based solely on a short bibliographic reference. We seek to automatically augment these references with images from the article that may assist in fin...
متن کاملAutomatic Detection of Arrow Annotation Overlays in Biomedical Images
Images in biomedical articles are often referenced for clinical decision support, educational purposes, and medical research. Authors-marked annotations such as text labels and symbols overlaid on these images are used to highlight regions of interest which are then referenced in the caption text or figure citations in the articles. Detecting and recognizing such symbols is valuable for improvi...
متن کامل